Inferring single-cell gene expression mechanisms using stochastic simulation

نویسندگان

  • Bernie J. Daigle
  • Mohammad Soltani
  • Linda R. Petzold
  • Abhyudai Singh
چکیده

MOTIVATION Stochastic promoter switching between transcriptionally active (ON) and inactive (OFF) states is a major source of noise in gene expression. It is often implicitly assumed that transitions between promoter states are memoryless, i.e. promoters spend an exponentially distributed time interval in each of the two states. However, increasing evidence suggests that promoter ON/OFF times can be non-exponential, hinting at more complex transcriptional regulatory architectures. Given the essential role of gene expression in all cellular functions, efficient computational techniques for characterizing promoter architectures are critically needed. RESULTS We have developed a novel model reduction for promoters with arbitrary numbers of ON and OFF states, allowing us to approximate complex promoter switching behavior with Weibull-distributed ON/OFF times. Using this model reduction, we created bursty Monte Carlo expectation-maximization with modified cross-entropy method ('bursty MCEM(2)'), an efficient parameter estimation and model selection technique for inferring the number and configuration of promoter states from single-cell gene expression data. Application of bursty MCEM(2) to data from the endogenous mouse glutaminase promoter reveals nearly deterministic promoter OFF times, consistent with a multi-step activation mechanism consisting of 10 or more inactive states. Our novel approach to modeling promoter fluctuations together with bursty MCEM(2) provides powerful tools for characterizing transcriptional bursting across genes under different environmental conditions. AVAILABILITY AND IMPLEMENTATION R source code implementing bursty MCEM(2) is available upon request. CONTACT [email protected] SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Gene expression Inferring single-cell gene expression mechanisms using stochastic simulation

Motivation: Stochastic promoter switching between transcriptionally active (ON) and inactive (OFF) states is a major source of noise in gene expression. It is often implicitly assumed that transitions between promoter states are memoryless, i.e. promoters spend an exponentially distributed time interval in each of the two states. However, increasing evidence suggests that promoter ON/OFF times ...

متن کامل

SINCERITIES: inferring gene regulatory networks from time-stamped single cell transcriptional expression profiles

Motivation Single cell transcriptional profiling opens up a new avenue in studying the functional role of cell-to-cell variability in physiological processes. The analysis of single cell expression profiles creates new challenges due to the distributive nature of the data and the stochastic dynamics of gene transcription process. The reconstruction of gene regulatory networks (GRNs) using singl...

متن کامل

Semi-quantitative Analysis of Expression of Various Genes in relation to Possible Markers for Theileria annulata Attenuation

  The sporozoites of Theileria annulata invade bovine MHC II cells, where they differentiate into schizonts. The later can immortalize and induce fundamental changes in their host cells. Live attenuated vaccine is an important way of controlling T. annulata infection of cattle. Production is by prolonged cultivation of macroschizont-infected cells. The mechanisms underlying this transformation ...

متن کامل

Therapeutic Efficacy Analysis of lncRNA NEAT1 Gene Knockout and Apoptosis Induction in Prostate Cancer Cell Line Using CRISPR/Cas9

Background and Objective: Long non-coding ribonucleic acid (lncRNA) has been identified as an important gene regulator and prognostic marker in various cancers. The present study aimed to investigate the effects of Nuclear Paraspeckle Assembly Transcript1 (NEAT1) gene knockout using Clustered Regularly Interspaced Short Palindromic Repeats-associated Protein 9 (CRISPR/Cas9) in PC-3 cell line. ...

متن کامل

A Hierarchical Bayesian Model for Inference of Copy Number Variants and Their Association to Gene Expression By

A number of statistical models have been successfully developed for the analysis of high-throughput data from a single source, but few methods are available for integrating data from different sources. Here we focus on integrating gene expression levels with comparative genomic hybridization (CGH) array measurements collected on the same subjects. We specify a measurement error model that relat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 31 9  شماره 

صفحات  -

تاریخ انتشار 2015